智能论文笔记

The Common Voice corpus is a massively-multilingual collection of transcribed speech intended for speech technology research and development. Common Voice is designed for Automatic Speech Recognition purposes but can be useful in other domains (e.g. language identification). To achieve scale and sustainability, the Common Voice project employs crowdsourcing for both data collection and data validation. The most recent release includes 29 languages, and as of November 2019 there are a total of 38 languages collecting data. Over 50,000 individuals have participated so far, resulting in 2,500 hours of collected audio. To our knowledge this is the largest audio corpus in the public domain for speech recognition, both in terms of number of hours and number of languages. As an example use case for Common Voice, we present speech recognition experiments using Mozilla's DeepSpeech Speech-to-Text toolkit. By applying transfer learning from a source English model, we find an average Character Error Rate improvement of 5.99 ± 5.48 for twelve target languages (German, French, Italian, Turkish, Catalan, Slovenian, Welsh, Irish, Breton, Tatar, Chuvash, and Kabyle). For most of these languages, these are the first ever published results on end-to-end Automatic Speech Recognition.

translated by 谷歌翻译

金属有机框架（MOF）是一类模块化的多孔晶体材料，具有巨大的革命性应用，例如储气，分子分离，化学感应，催化和药物输送。剑桥结构数据库（CSD）报告了10,636个合成的MOF晶体，此外还包含CA。114,373个类似MOF的结构。综合数量（加上可能合成的）MOF结构数量庞大，需要研究人员追求计算技术来筛选和分离MOF候选物。在此演示论文中，我们描述了我们在利用知识图方法方面促进MOF预测，发现和综合方面的努力。我们提出了有关（1）从结构化和非结构化来源构建MOF知识图（MOF-KG）的挑战和案例研究，以及（2）利用MOF-KG来发现新知识或缺失知识。

translated by 谷歌翻译

Effect of Prior-based Losses on Segmentation Performance: A Benchmark

Rosana {EL JURDI} , Caroline Petitjean , Veronika Cheplygina , Paul Honeine , Fahed Abdallah

分类：人工智能 | 计算机视觉

2022-01-07

如今，深度卷积神经网络（CNNS）对医学图像分割的最先进的性能，在各种成像方式和任务上。尽管已经提早成功，分割网络可能仍然产生解剖学上异常的分割，并且在物体边界附近具有孔或不准确的孔。为了实施解剖学合理性，最近的研究研究专注于将现有知识（例如对象形状或边界）掺入，作为损耗功能的约束。之前的集成可以是低级，参考从地面真理分割中提取的重新表达，或者高级代表外部医疗信息，例如器官的形状或大小。在过去的几年里，基于事先的损失在研究领域的兴趣表现出了兴趣，因为它们允许一体化专家知识，同时仍然是架构 - 不可知论者。然而，鉴于对不同医学成像挑战和任务的先前损失的多样性，它变得难以确定最适合数据集的损失工作。在本文中，我们建立了近期基于医学图像分割损失的基准。主要目的是提供直觉，以便给定特定任务或数据集的损失。为此，选择了四个低级和高级的基于先前的损耗。考虑的损失在8个不同的数据集中验证了来自各种医学图像分割挑战，包括迪卡侬，群岛和WMH挑战。结果表明，虽然低级别的先前损耗可以保证骰子损耗基线的性能提高，但无论数据集特性如何，高级别的先前损耗都可以根据数据特征提高解剖合理性。

translated by 谷歌翻译